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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/836,075A 



DATE: 02/04/98 
TIME: 15:55:31 



INPUT SET: S23159.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 




SEQUENCE LISTING 




General Information: 



(i) APPLICANT: MAERTENS, GEERT 
STUYVER, LIEVEN 



' (ii) TITLE- OF INVENTION: NEW SEQUENCES OF HEPATITIS C VIRUS GENOTYPES 
AND THEIR USE AS PROPHYLACTIC, THERAPEUTIC AND DIAGNOSTIC 
AGENTS 

(iii) NUMBER OF SEQUENCES: 207 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ARNOLD, WHITE & DURKEE 

(B) STREET: P.O. BOX 4433 

(C) CITY: HOUSTON 

(D) STATE: TEXAS 

(E) COUNTRY: USA 

(F) ZIP: 77210-4433 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Microsoft Word 6.0 / ASCII text output 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/836,075 

(B) FILING DATE: 21 Apr 1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/EP95/04155 

(B) FILING DATE: 23 Oct 1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP 94870166.9 

(B) FILING DATE: 21 Oct 1994 

(viii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP 95870076.7 

(B) FILING DATE: 28 Jun 1995 



(ix) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: KAMMERER, PATRICIA A. 

(B) REGISTRATION NUMBER: 29,775 
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RAW SEQUENCE LISTING DATE: 02/04/98 

PATENT APPLICATION US/08m6,075A TIME: 15:55:34 

INPUT SET: S23159.raw 

(C) REFERENCE/DOCKET NUMBER: INNS: 004 
(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO " 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA ACACCAACCG CCGCCCTCAK 60 

GGSGTNNNNN NNCCGGGTGG CGGTCAGATC GTTGGTGGAG TTTACCTGTT GCCGCGCAGG 120 

GGCCCCAGGN NGGGTGTGCG CGCGACTAGG AAGACTTCCG AGCGGTCACA ACCTCGTGGC 180 

AGGCGACAGC CTATCCCCAA GGCTCGYCGG YCCGAGGGCA GGTCCTGGGC TCAGCCCGGG 240 

TATCCTTGGC CCCTCTATGG CAATGAGGGC TGCGGGTGGG CGGGNTGGCT CCTGTCCCCC 300 

CGCGGCTCTC GGCCCAATTG GGGCCCC 327 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Xaa Xaa Xaa Xaa Xaa Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Xaa Gly Val Arg Ala 



f 



PAGE: 3 RAW SEQUENCE LISTING DATE: 02/04/98 

PATENT APPLICATION US/08/836,075A TIME: 15:55:37 

INPUT SET: S23159.raw 
100 35 40 45 

101 

102 Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

103 50 55 60 
104 

105 lie Pro Lys Ala Xaa Arg Xaa Glu Gly Arg Ser Trp Ala Gin Pro Gly 

106 65 70 75 80 
107 

108 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Xaa Trp 

109 85 90 95 
110 

111 Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro 

112 100 " 105 . 
113 

114 (2) INFORMATION FOR SEQ ID NO: 3: 
115 

116 (i) SEQUENCE CHARACTERISTICS: 

117 (A) LENGTH: 447 base pairs 

118 (B) TYPE: nucleic acid 

119 (C) STRANDEDNESS: single 

120 (D) TOPOLOGY: linear 
121 

122 (ii) MOLECULE TYPE: cDNA 

123 

124 (iii) HYPOTHETICAL: NO 

125 

126 (iii) ANTI-SENSE: NO 

127 

128 

129 

130 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

131 

132 GACGGCGTGA ACTATGCAAC AGGGAACTTG CCCGGTTGCT CTTTCTCTAT CTTCCTCTTG 60 
133 

134 GCTTTGCTGT CCTGCTTGAC GGTTCCAACK ACCGCTCACG AGGTGCGCAA CGCATCCGGG 120 
135 

136 GTGTATCATG TCACCAACGA CTGTTCCAAC TCGAGCATCA TCTATGAGAT GGACGGTATG 180 
137 

138 ATCATGCACT ACCCAGGGTG CGTGCCCTGC GTTCGGGAGG ATAACCATCT CCGCTGCTGG 240 
139 

140 ATGGCGCTCA CCCCCACGCT TGCGGTCAAA AAYGCTAGTG TCCCCACTRC GGCAATCCGA 300 
141 

14 2 CGTCACGTCG ACTTGCTTGT TGGGGGNNCC ACGTTCTGTT CCGCTATGTA CGTGGGRGAC 360 
143 

144 CTTTGCGGGT CTGTCTTCCT CGCTGGCCAG CTATTCACCT TTTCACCCCG CATGCACCAT 42 0 

145 

146 ACAACGCAGG AGTGCAACTG CTCAATC 44 7 

147 

14 8 (2) INFORMATION FOR SEQ ID NO: 4: 
149 

150 (i) SEQUENCE CHARACTERISTICS: 

151 (A) LENGTH: 149 amino acids 

152 (B) TYPE: amino acid 
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INPUT SET: S23159.ruw 

153 (D) TOPOLOGY: linear 

154 

155 (ii) MOLECULE TYPE: peptide 

156 

157 

158 

15 9 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

160 

161 Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser 

162 15 10 15 

- 163 - . 

164 lie" Phe" Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Xaa Thr Ala 

165 20 25 - 30 
166 

167 His Glu Val Arg Asn Ala Ser Gly Val Tyr His Val Thr Asn Asp Cys 

168 35 40 45 
169 

170 Ser Asn Ser Ser lie lie Tyr Glu Met Asp Gly Met lie Met His Tyr 

171 50 55 60 
172 

17 3 Pro Gly Cys Val Pro Cys Val Arg Glu Asp Asn His Leu Arg Cys Trp 

174 65 70 75 80 

175 

176 Met Ala Leu Thr Pro Thr Leu Ala Val Lys Xaa Ala Ser Val Pro Thr 

177 85 90 95 
178 

179 Xaa Ala lie Arg Arg His Val Asp Leu Leu Val Gly Xaa Xaa Thr Phe 

180 100 105 110 
181 

182 Cys Ser Ala Met Tyr Val Xaa Asp Leu Cys Gly Ser Val Phe Leu Ala 

183 115 120 125 
184 

185 Gly Gin Leu Phe Thr Phe Ser Pro Arg Met His His Thr Thr Gin Glu 

186 130 135 140 
187 

188 Cys Asn Cys Ser lie 

189 145 
190 

191 (2) INFORMATION FOR SEQ ID NO: 5: 
192 

193 (i) SEQUENCE CHARACTERISTICS: 

194 (A) LENGTH: 327 base pairs 

195 (B) TYPE: nucleic acid 

196 (C) STRANDEDNESS : single 

197 (D) TOPOLOGY: linear 
198 

199 (ii) MOLECULE TYPE: cDNA 

200 

201 (iii) HYPOTHETICAL: NO 

202 

203 (iii) ANTI-SENSE: NO 

204 

205 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/83 6,075 A 



DATE: 02/04/98 
TIME: 15:55:44 



INPUT SET: S231S9.raw 

206 

207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

208 

209 ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTA ACACCAACCG CCGCCCACAG 60 
210 

211 GACGTCAAGN TCCCGGGTGG TGGTCAGATC GTTGGTGGAG TTTACCTGTT GCCGCGCAGG 120 
212 

213 GGCCCCAGGT TGGGTGTGCG CGCGACCAGG AAGACTTCCG AGCGGTCGCA GCCTCGTGAC 180 
214 

215 AGGCGACAGC CTATTCCTAA GGCTCGCCAG TCCGATGGCA GNNCCTGGGC TCAGCCAGGG 24 0 

216 " - 

217 CATCCCTGGC CCCTCTATGG CAATGAGGGC TGCGGATGGG CGGGATGGCT CCTGTCCCCC 300 
218 

219 CGCGGCTCTC GGCCCAGTTG GGGCCCC ' ' 327 

220 

221 (2) INFORMATION FOR SEQ ID NO: 6: 
222 

223 (i) SEQUENCE CHARACTERISTICS: 

224 (A) LENGTH: 109 amino acids 

225 (B) TYPE: amino acid 

226 (D) TOPOLOGY: linear 
227 

228 (ii) MOLECULE TYPE: peptide 

229 

230 

231 

232 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

233 

2 34 Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

235 15 10 15 

236 

237 Arg Arg Pro Gin Asp Val Lys Xaa Pro Gly Gly Gly Gin lie Val Gly 

238 20 25 30 
239 

240 Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

241 35 40 45 
242 

24 3 Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Asp Arg Arg Gin Pro 

244 50 55 60 

245 

246 lie Pro Lys Ala Arg Gin Ser Asp Gly Xaa Xaa Trp Ala Gin Pro Gly 

247 65 70 75 80 
248 

24 9 His Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
250 85 90 95 

251 

252 Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro 

253 100 105 
254 

255 (2) INFORMATION FOR SEQ ID NO: 7: 
256 

25 7 (i) SEQUENCE CHARACTERISTICS: 
258 (A) LENGTH: 447 base pairs 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/836,075A 



DATE: 02/04/98 
TIME: 15:55:47 



INPUT SET: S231S9.raw 

Original Text 



